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DETAILED ACTION 
Claim Rejections - 35 USC § 102 

1 . The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that 
fornfi the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

2. Claims 1 -8, 9-13 and 18-31 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Schuiz (6,185,538). 

Regarding claims 1 and 22, Schuiz discloses a method for producQng an 
audiovisual work , the method comprises the steps of : 

providing an audio signal to a speech recognition associating module t (column 1 
line 60 to column 2, line 68, column 4 lines 25-49); receiving and collecting and 
associating basic units of recognized speech and related time codes received from a 
speech recognition module (Fig. 2, column 4, lines 35-47,,column 5, lines 40-46, 
processing basis units to provide synchronization information for production of the 
audiovisual work (column 2, lines 25-35); and display user interface the synchronization 
information (Fig. 2). 

Regarding claim 2, Schuiz further teaches a graphic representation a point of 
sound to be perform (Fig. 2, column 6, lines 57 to column 7, line 7) . 
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Regarding claims 3, Schuiz teaches wherein the basic units of recognized 
speech are phonemes (column (column 4, lines 25-35) . 



Regarding claim 4,Schu!z teaches the step of converting the basic units of 
recognized speech received with the time codes into words and words related 
time codes, (column 4, lines 35-50). 

Regarding claim 5, Schuiz teaches the step of converting the basic units of 
recognized speech received with the time codes into graphemes and graphemes 
related time codes, the graphemes being processed to provide synchronization 
information (Fig. 2, column 4, lines 35-50). 

Regarding claim 6, Schuiz teaches n the step of providing a conformed text 
source, further wherein the synchronization information provided to the user 
comprises an indication of a temporal location with respect to the audio signal 
(column 6, lines 67 to column 7,line 7). 

Regarding claim 7, Schuiz teaches the step of providing a script of at least one 
part of the audio signal, further wherein the synchronization information provided 
to the user comprises an indication of a temporal location with respect to the 
script provided (column 5, lines 25-55). 

Regarding claim 8, Schuiz teaches the displaying on a user interface of said 
synchronization information, comprises the displaying of the graphemes using a 
horizontally sizeable font (Fig. 2). 
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Regarding claim 10, Schuiz teaches the step of amending at least one part of 
the audio signal and audio signal related time codes using at least the 
graphemes and the synchronization information. Sine Schuiz teaches that the 
audio can be edited using the time code and indication on a display (Fig, 2, 
column 3). 

Regarding claim 1 1 , Schuiz teaches providing of a plurality of words in 
accordance with the provided audio signal, the providing being performed by an 
operator (column 3, lines 15-35). 

Regarding claim 12, Schuiz teaches the step of amending a recognized word in 
accordance with the plurality of words provided by the operator (column 3). 
Regarding claim 13 , Schuiz teaches the step of creating a composite signal 
comprising at least the amended word, a video signal related to the audio source 
and the audio source (column 2, lines 5-20). 

Regarding claim 19, Schuiz further teaches a adaptation assisting comprises a 
graphic representation of the plurality of basic units of recognized speech, the 
related time codes and a plurality of adapted basic units provided by a user, and 
said interface providing a visual indication of a matching of the plurality of 
adapted basic units with the plurality of basic speech units, the matching 
enabling synchronized adaptation of said audio signal (Fig. 2). 
Regarding claim 20, Schuiz teaches the plurality of adapted basic units is 
provided by performing a speech recognition of an adapted voice source, 
(column 4, lines 25-35) 
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Regarding claim 21, Schuiz teaches the speech recognition of the adapted voice 
source further provides related adapted time codes and adapting the audio 
signal using said synchronization information and the plurality of adapted basic 
units is performed by attempting to match at least one of the plurality of basic 
units with at least one of the plurality of adapted basic units using the related time 
codes and the related adapted time codes. (Fig. 2, column 4) 
Regarding claim 23, Schuiz teaches providing an indication of an amount of 
successful replacement of the plurality of basic units of recognized speech of the 
audio signal by the plurality of basic units of recognized speech of the adapted 
audio signal (Fig. 2, column 2, lines 25-55). 

Regarding claim 24, Schuiz teaches the step of providing a minimum amount 
required of successful replacement of the plurality of basic units of recognized 
speech of the audio signal by the plurality of basic units of recognized speech of 
the adapted audio signal, the method further comprising the step of canceling the 
providing of the at least one replaced plurality of basic units with related replaced 
time codes if the at least one replaced plurality of basic units is lower than the 
minimum amount required of successful replacement since Schuiz teaches that 
the audio video and text can be edited (column 5, lines 5-25). 
Regarding claim 25, Schuiz teaches the audio signal comprises a plurality of 
voices originating from a plurality of actors, further comprising the step of 
assigning each of the plurality of basic units and the related time codes to a 
related actor of the plurality of actors (column. 3, lines 13-24) 
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Regarding claim 26, Schuiz teaches the production comprises closed-captioning 
production of the audio source, said closed-captioning comprises a graphic 
representation of the recognized plurality of basic units, the method further 
comprising the incorporating of at least one of the plurality of basic units as 
closed-captioning in a visual or non-visual portion of the audio/video portion of 
the audio/video signal in synchronization (Fig. 2 column 5, lines 25-45). 
Regarding claim 27, Schuiz teaches g the step of amending at least one part of 
the plurality of basic units. 

Regarding claim 28, Schuiz teaches f converting the basic units of recognized 
speech received with the time codes into words and words related time codes, 
further comprising the step of creating a database comprising a word and related 
basic units. (column 5, lines 7-40) 

Regarding claim 29, Schuiz teaches amending a word of said database, wherein 
phonemes of the word and the amended word are substantially the same 
(column 5, lines 7-25). 

Regarding claim 30, Schuiz teaches the step of converting the basic units of 
recognized speech received with the time codes into words and words related 
time codes, further comprising the step of amending at least one word. (column 5, 
lines 7-25) 

Regarding claim 31, Schuiz teaches providing a visual indication of a word to 
amend (column 5, lines 25-40). 

Regarding claim 33, Schuiz teaches detecting at least one note encoded in the 
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audio signal according to an encoding scheme, further comprising the providing 
of the detected at least one note on said graphic representation (Fig. 2 (column 
25-40, column 6, lines 60 to column 7, line 7). 

Claim Rejections - 35 USC §103 

3. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the Invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner In which the invention was made. 

This application currently names joint inventors. In considering patentability of 

the claims under 35 U.S.C. 103(a), the examiner presumes that the subject matter of 

the various claims was commonly owned at the time any inventions covered therein 

were made absent any evidence to the contrary. Applicant is advised of the obligation 

under 37 CFR 1 .56 to point out the inventor and invention dates of each claim that was 

not commonly owned at the time a later invention was made in order for the examiner to 

consider the applicability of 35 U.S.C. 103(c) and potential 35 U.S.C. 102(e), (f) or (g) 

prior art under 35 U.S.C. 103(a). 

4. Claim 9 is rejected under 35 U.S.C. 103(a) as being unpatentable over Schuiz 
in view of Casey (20010044719 A). 

Regarding claim 9, Schuiz fails to specifically teach detecting a Foley in the audio 
signal . Casey teaches detecting a Foley in a audio signal (section 0009). It would have 
been obvious to one of ordinary skill in the art t modify Schuiz with Foley by using the 
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teaching of Casey to detect a Foley in tfie audio signal thereby accurately indexing 
the audio signal . 

5. Claims 14-18 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Schuiz in view of Lande et a! (6665643) . 

Regarding claims 14-15, Schuiz fails to specifically teach using to produce 
animation. . 

Lande teaches using visem in prediction of the animation . It would have bee 
obvious to one of ordinary skill in the art to modify Schuiz with Lande by using the 
teaching of Lande for using the visem t product animation thus enhancing the capacity 
of the apparatus of Schuiz. 

Regarding claim 16 , Schuiz as modified with and further teaches providing a storyboard 
database, further comprising the step of converting the basic units of recognized speech 
received with the time codes into words and words related time codes, the processing of 
the plurality of words and the words related time codes providing an indication of a 
current temporal location of the audio signal with respect to the storyboard (Schuiz , Fig. 
2, column 2, lines 55-68). 

Regarding claim 17, Schuiz as modified with Lande teaches n the basic units of 
recognized speech are phonemes, further comprising the step of providing a plurality of 
visems for each of the plurality of words, using a visem database and using the 
phonemes. 

Regarding claim 18, Schuiz as modified with Lande further teaches outputting an 
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adjusted voice track comprising tiie audio signal, at least one part of the storyboard and 
the plurality of visems. 

6. Claim 32 is rejected under 35 U.S.C. 103(a) as being unpatentable over Schuiz 
in view of Olmedo (6174170). 

Regarding claim 32, Schuiz fails to specifically teach that the audio video work 
comprising karaoke generations . However, it is noted that generating a karaoke 
having audio signal lyrics and point time is well known in the art as taught Olmedo . 
Therefore, it would have been obvious to one of ordinary skill in the art to modify 
Schuiz by using the teaching of Olmedo with the apparatus of Schuiz to generating 
the text , audio and video to form a karaoke with lyrics using the text thereby 
enhancing the capacity of the apparatus of Schuiz. 

7. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to HUY T. NGUYEN whose telephone number is (571) 
272-7378. The examiner can nonnally be reached on 8:30AM -6:00PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, John W. Miller can be reached on (571) 272-7353. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished. applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571^272-1000. 



